SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: John E. Piletz 
Tina R» Ivanov 
(ii) TITLE OF INVENTION: DNA Sequence Encoding a Human 
Imidazoline Receptor 
(iii) NUMBER OF SEQUENCES: 22 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: WENDEROTH, LIND & PO^fACK 

(B) STREET: 805 Fifteenth St, N.W., Suite 700 

(C) CITY: Washington 

(D) STATE: District of Columbia 

(E) COUNTRY: U.S.A. 

(F) ZIP: 20005 
(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.50" 1.44 Mb diskette 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: Wordperfect 5.1+ 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: new 

(B) FILING DATE: 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NO.:. 60/012,600 
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(B) FILING DATE: March 1, 1996 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NO.: 08/650,766 

(B) FILING DATE: May 20, 1996 
(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Warren Cheek 

(B) REGISTRATION NUMBER: 33,367 

(C) REFERENCE/ DOCKET NUMBER: WMC-13 42 / clone 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 371-8850 

□ (B) TELEFAX: (202) 371-8856 
% (2) INFORMATION FOR SEQ ID NO: 1 

p (i) SEQUENCE CHARACTERISTICS: 

t: (A) LENGTH: 3 3 89 base pairs 

^-^ (B) TYPE: nucleic acid 

f; (C) STRANDEDNESS: double 

□ (D) TOPOLOGY: linear 
D (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: cDNA 

(B) CLONE: EST04033 (HFBDP28) 
(ix) FEATURE: 

(A) NAME/KEY: predicted translation product when 
transfected 

(B) LOCATION: 1398 ... 3389 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 

:TCTAGAAC TAGTGGATCC CCCGGGCTGC AGGAATTCCA GTTTAATACT AACCCTAATG 60 

STGACTGCG GTTTACAAAG AGCTCTGTAT CACCTGGGAT AGCTTTCAGT AGCAATTCAC 12 0 

\CAACTGGT CCTAAAAAAT AATAACAATA ATAATAATAA TTAGAGAATT AAAACCCAAC 18 0 

aCATGTTGA ATGGTTAAAA TCACGTAAGA ACTGAAATTT GGGGTGGGGG TGTCCTCAAC 240 

SCTGAGCTT GTCCTAGCAG TGAAAATGCT CGCCTCCAAG CAGGGCTCAG AAAGGTCTGG 3 00 

^CCCTCCAG GCAGAGGGCT GAGCTCAGGG GGCTCTTGGA GGACACTCAC CCCATGGTCC 3 60 

DGGGATGCT TCTGGCTTCC TTAAAAACAG TTGGGCATCC GCATTGTATA AGTAGGTGGA 42 0 

^CCCTAGTG TGGTTCTTTT GAAGGATATG GGAAGGGAGG ATGACGAACT AGAGAAGTGG 480 

VGGGGACCA AAATCACTGA GGTCCCAGAA TATCATAGAT TTGGGTATAG GATTGGGGTC 54 0 

:TAAGAATT GAGCACCAGG AATTCCAGCT TCTTCCCATT AAAGAAACTG GGACTGGTTT 600 

;CCTTGGAG GCCTATGTAG TGTTTTCTGC CCCTGTCCCA TACCAAGTCT CATTGATATT 660 

:T(^CAGAAT ATCAGATGAA AATCTATTTC TAAAGACCAT TGGGAGAATG GGTGGTGGAG 72 0 

^G^GTTGG AGTGGGGTTG GGGGGCAGTT AAAAATGAAT AAAAATCTCT CAGCTACAGA 780 

:CcftAACAT CACTTCCCTC CGCATTCACA GCATTTCCCA GCAGTCCCCA GATGGTTGTT 84 0 

:C(ItGGGGA CACAGCAGCT GCCTCATTTC CCTTCAGGCC CCATGGGCTG CTGGTCAACC 9 00 

:A(^ATCTA CTAAAGATGA CGCAAATGCC GACTGAACAA TCTGAAACCC AAAGGACTCG 9 60 

;G^AGACA TGTTCTGCTG AGGAGAGAAA GGTGAGCCAA GGGCAGGGCC CAGGTCCCCC 102 0 

;GCglGCCCC CGAGAGCCCG GACATGCACC TTCTGGATGT GTTTGTTCAA GTAGGACTTA 108 0 

iGCGGAAGA AGCTCCCACA TTCAGGGCAT GGGTACTTCT TCTCCCCATC AGACTCCATT 114 0 

?GTTTTTGG GGACTGCCAT GTCGCAGGAG AAAGAGCCAT TGGCACTCTG CTTCTCTGGC 12 00 

i-CTTCAGGT CGCTGGCATC TGAGAGGTCA CCATAGGAGT CAGAGCTCTC AATCGGATCC 12 60 

;ATGTGAGC ATTTCTGGCC TTCTCGGTTA CAGATACTGC AGAAGTTGCT GGGCCCCTCG 13 2 0 

^GTGCTTCT TCAGGTGGTC TGCCATGTAT GCTGCCCGCA AGTACTTCCC ACACACCTGG 13 8 0 

.GGGCACCT TGTCTTC ATG ACA GGC CAG GTG GGA GCG CAG ACG GTC TCG 14 3 0 

Met Thr Gly Gin Val Gly Ala Gin Thr Val Ser 
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GGC AAA AGA AGC ATT GCA GGT CTG AC A CTT GTG AGG CCG CTC AGA 14 7 8 

Ly Gly Lys Arg Ser lie Ala Gly Leu Thr Leu Val Arg Pro Leu Arg 
15 20 25 

;T GTG CAC CTG CTT GAT ATG TCC GTT CAA GTG ATC AGG CCT GGA GAA 152 6 

Val His Leu Leu Asp Met Ser Val Gin Val lie Arg Pro Gly Glu 
30 35 40 

:C,jrTT CCC ACA GCT CTG GCA gat GTA AGG CGG AAT TCC CCA GAG AAG 1574 

La-JPhe Pro Thr Ala Leu Ala Asp Val Arg Trp Asn Ser Pro Glu Lys 

^5 50 55 

ill 

^GIi^GT GGT GAA GAC TCC CGG CTC TCA GCT GCC CCC TGC ATC AGA CCC 1622 

^si^ly Gly Glu Asp Ser Trp Leu Ser Ala Ala Pro Cys lie Arg Pro 

) □ 65 70 75 

SC'^GC TCC CCT CCC ACT GTG GCT CCC GCA TCT GCC TCC CTG CCC CAG 167 0 

jr Ser Ser Pro Pro Thr Val Ala Pro Ala Ser Ala Ser Leu Pro Gin 

80 85 90 



:C ATC CTC TCT AAC CAA GGA ATC ATG TTC GTT CAG GAG GAG GCC CTG 1718 
:o lie Leu Ser Asn Gin Gly lie Met Phe Val Gin Glu Glu Ala Leu 
95 100 105 
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CC AGC AGC CTC TCG TCC ACT GAG AGT CTG ACT CCC GAG CAC CAG CCC 17 66 

la Ser Ser Leu Ser Ser Thr Asp Ser Leu Thr Pro Glu His Gin Pro 
110 115 120 

TT GCC CAG GGA TGT TCT GAT TCC TTG GAG TCC ATC CCT GCG GGA CAG 1814 
le Ala Gin Gly Cys Ser Asp Ser Leu Glu Ser lie Pro Ala Gly Gin 
125 130 135 

CA GCT TCC GAT GAT TTA AGG GAC GTG CCA GGA GCT GTT GGT GGT GCA 1862 
la Ala Ser Asp Asp Leu Arg Asp Val Pro Gly Ala Val Gly Gly Ala 
4Q^, 145 150 155 

GG^^'CCA GAA CAT GCC GAG CCG GAG GTC CAG GTG GTG CCG GGG TCT GGC 1910 
edJJpro Glu His Ala Glu Pro Glu Val Gin Val Val Pro Gly Ser Gly 
y 160 165 170 

Aq^lATC ATC TTC CTG CCC TTC ACC TGC ATT GGC TAG ACG GCC, ACC AAT 1958 

iB 

lAfiTle lie Phe Leu Pro Phe Thr Cys lie Gly Tyr Thr Ala Thr Asn 
175 180 185 

AG GAC TTC ATC CAG CGC CTG AGC ACA CTG ATC CGG CAG GCC ATC GAG 2 006 

In Asp Phe lie Gin Arg Leu Ser Thr Leu lie Trp Gin Ala lie Glu 
190 195 200 

3G CAG CTG CCT GCC TGG ATC GAG GCT GCC AAC CAG CGG GAG GAG GGC 2 054 

rp Gin Leu Pro Ala Trp lie Glu Ala Ala Asn Gin Trp Glu Glu Gly 
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205 210 215 

GGT GAA CAG GGC GAG GAG GAG GAT GAG GAG GAG GAA GAA GAG GAG 2102 

In Gly Glu Gin Gly Glu Glu Glu Asp Glu Glu Glu Glu Glu Glu Glu 

20 225 230 235 

GTG GGT GAG AAC CGC TAG TTT GAA ATG GGG CCC CCA GAC GTG GAG 2150 
sp Val Ala Glu Asn Arg Tyr Phe Glu Met Gly Pro Pro Asp Val* Glu 

240 245 250 

AGd-GAG GAG GGA GGA GGC CAG GGG GAG GAA GAG GAG GAG GAA GAG GAG 2198 
lu";blu Glu Gly Gly Gly Gin Gly Glu Glu Glu Glu Glu Glu Glu Glu 
^ 255 260 265 

?^T^^bAA GAG GCC GAG GAG GAG CGC CTG GCT CTG GAA TGG GCC CTG GGC 2 2 46 

spt^^lu Glu Ala Glu Glu Glu Arg Leu Ala Leu Glu Trp Ala Leu Gly 
□ 270 275 280 

Cg"g AC GAG GAC TTC CTG CTG GAG CAC ATC CGC ATC CTC AAG GTG CTG 2 294 

la Asp Glu Asp Phe Leu Leu Glu His lie Arg lie Leu Lys Val Leu 
285 290 295 

2G TGC TTC CTG ATC CAT GTG CAG GGC AGT ATC CGC CAG TTC GCC GCC 2 34 2 

rp Cys Phe Leu lie His Val Gin Gly Ser lie Arg Gin Phe Ala Ala 
DO 305 310 315 
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GC CTT GTG CTC ACC GAC TTC GGC ATC GCA GTC TTC GAG ATC CCG CAC 2 3 90 

ys Leu Val Leu Thr Asp Phe Gly lie Ala Val Phe Glu lie Pro His 

320 325 330 

AG GAG TCT CGG GGC AGC AGC CAG CAC ATC CTC TCC TCC CTG CGC TTT 2 438 

In Glu Ser Trp Gly Ser Ser Gin His lie Leu Ser Ser Leu Arg Phe 
335 340 345 

;TC TTT TGC TTC CCG CAT GGC GAC CTC ACC GAG TTT GGC TTC CTC ATG 248 6 

'al Phe Cys Phe Pro His Gly Asp Leu Thr Glu Phe Gly Phe Leu Met 

350 355 360 

'XI 

:CC^GAG CTG TGT CTG GTG CTC AAG GTA CGG CAC AGT GAG AAC ACG CTC 253 4 

•rdfflGlu Leu Cys Leu Val Leu Lys Val Arg His Ser Glu Asn Thr Leu 

1x1365 370 375 

'TciPiATT ATC TCG GAC GCC GCC AAC CTG CAC GAG TTC CAC GCG GAC CTG 2 582 

'h4f|Ile lie Ser Asp Ala Ala Asn Leu His Glu Phe His Ala Asp Leu 
8Cr 385 390 395 

:GC TCA TGC TTT GCA CCC CAG CAC ATG GCC ATG CTG TGT AGC CCC ATC 2 63 0 

.rg Ser Cys Phe Ala Pro Gin His Met Ala Met Leu Cys Ser Pro lie 

400 405 410 

:TC TAC GGC AGC CAC ACC AGC CTG CAG GAG TTC CTG CGC CAG CTG CTC 2 678 

,eu Tyr Gly Ser His Thr Ser Leu Gin Glu Phe Leu Arg Gin Leu Leu 
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415 420 425 

2C TTC TAG AAG GTG OCT GGC GGC TGC CAG GAG CGC AGC CAG GGC TGC 27 2 6 

nr Phe Tyr Lys Val Ala Gly Gly Cys Gin Glu Arg Ser Gin Gly Cys 
430 435 440 

rC CCC GTC TAG GTG GTG TAG AGT GAG AAG CGG ATG GTG GAG AGG GGG 2 774 

le Pro Val Tyr Leu Val Tyr Ser Asp Lys Arg Met Val Gin Thr Ala 
445 450 . 455 

2C GGG GAG TAG TCA GGG AAG ATG GAG TGG GGG AGC TGG AGA GTC TGT 2 82 2 

l-a:J.bly Asp Tyr Ser Gly Asn lie Glu Trp Ala Ser Gys Thr Leu Cys 
50:^: 465 470 475 

m 

:A'=P;GG GTG GGG GGG TCC TGC TGC GGG GCG TGT gag GCG GTG AAG TGC 2 87 0 

ill 

srsiAla Val Arg Arg Ser Gys Gys Ala Pro Ser Glu Ala Val Lys Ser 
O 480 485 490 



:G,^GG ATG GGG TAG TGG GTG TTG GTC AGG GCG GAG GAG GTG AAG GTG 2918 
La Ala lie Pro Tyr Trp Leu Leu Leu Thr Pro Gin His Leu Asn Val 
495 500 505 

'C AAG GGC, GAG TTG AAG CGC ATG GCG AAC GGT GGC AGC CAG AAG TGT 2 966 

.e Lys Ala Asp Phe Asn Pro Met Pro Asn Arg Gly Thr His Asn Gys 
510 515 520 
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;C AAC CGC AAC AGC TTC AAG CTC AGC CGT GTG CCG CTC TCC ACC GTG 3 014 

:g Asn Arg Asn Ser PHe Lys Leu Ser Arg Val Pro Leu Ser Thr Val 
525 530 535 

?G CTG GAC CCC ACA CGC AGC TGT ACC CAG CCT CGG GGC GCC TTT GCT 3 062 

iu Leu Asp Pro Thr Arg Ser Cys Thr Gin Pro Arg Gly Ala Phe Ala 
:0 545 550 555 

iT GGC CAC GTG CTA GAG CTG CTC GTG GGG TAC CGC TTT GTC ACT GCC 3110 
;p Gly His Val Leu Glu Leu Leu Val Gly Tyr Arg Phe Val Thr Ala 
560 565 570 

^C:ftTC GTG CTG CCC CAC GAG AAG TTC CAC TTC CTG CGC GTC TAC AAC 3158 
.e#he Val Leu Pro His Glu Lys Phe His Phe Leu Arg Val Tyr Asn 
=P 575 580 585 

.GjjTG CGG GCC TCG CTG CAG GAC CTG AAG ACT GTG GTC ATC GCC AAG 3 2 06 

n^Jieu Arg Ala Ser Leu Gin Asp Leu Lys Thr Val Val lie Ala Lys 
590 595 600 

C CCC GGG ACG GGA GGC AGC CCC CAG GGC TCC TTT GCG GAT GGC CAG 3 2 54 

r Pro Gly Thr Gly Gly Ser Pro Gin Gly Ser Phe Ala Asp Gly Gin 
605 610 615 

T GCC GAG CGC AGG GCC AGC AAT GAC CAG CGT CCC CAG GAG GTC CCA 3 3 02 

o Ala Glu Arg Arg Ala Ser Asn Asp Gin Arg Pro Gin Glu Val Pro 

61 




20 625 630 635 

CA GAG GCT CTG GCC CCG GCC CCA GTG GAA GTC CCA GCT CCA GCC CCG 3 3 50 

la Glu Ala Leu Ala Pro Ala Pro Val Glu Val Pro Ala Pro Ala Pro 

640 645 650 

AA TTC GAT ATC AAG CTT ATC GAT ACC GTC GAC CTG CAG 3 3 89 

lu Phe Asp lie Lys Leu lie Asp Thr Val Asp Leu Gin 
655 660 664 

2) INFORMATION FOR SEQ ID NO: 2 

(i) SEQUENCE CHARACTERISTICS: 

Mi (A) LENGTH: 1954 base pairs 

m (B) TYPE: nucleic acid 

^] (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 2 
TdScAGGCC AGGTGGGAGC GCAGACGGTC TCGGGTGGCA AAAGAAGCAT TGCAGGTCTG 60 
CA0TTGTGA GGCCGCTCAG AAGTGTGCAC CTGCTTGATA TGTCCGTTCA AGTGATCAGG 12 0 
CTGGAGAAG CCTTTCCCAC AGCTCTGGCA GATGTAAGGC GGAATTCCCC AGAGAAGAAG 180 
3TGGTGAAG ACTCCCGGCT CTCAGCTGCC CCCTGCATCA GACCCAGCAG CTCCCCTCCC 2 40 
CTGTGGCTC CCGCATCTGC CTCCCTGCCC CAGCCCATCC TCTCTAACCA AGGAATCATG 3 00 
rCGTTCAGG AGGAGGCCCT GGCCAGCAGC CTCTCGTCCA CTGACAGTCT GACTCCCGAG 3 60 
\CCAGCCCA TTGCCCAGGG ATGTTCTGAT TCCTTGGAGT CCATCCCTGC GGGACAGGCA 4 20 
3TTCCGATG ATTTAAGGGA CGTGCCAGGA GCTGTTGGTG GTGCAAGCCC AGAACATGCC 4 80 
^GCCGGAGG TCCAGGTGGT GCCGGGGTCT GGCCAGATCA TCTTCCTGCC CTTCACCTGC 540 
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ttggctaca cggccaccaa tcaggacttc atccagcgcc tgagcacact gatccggcag 
ccatcgagc ggcagctgcc tgcctggatc gaggctgcca accagcggga ggagggccag 
gtgaacagg gcgaggagga ggatgaggag gaggaagaag aggaggacgt ggctgagaac 
gctactttg aaatggggcc cccagacgtg gaggaggagg agggaggagg ccagggggag 
aagaggagg aggaagagga ggatgaagag gccgaggagg agcgcctggc tctggaatgg 
ccctgggcg cggacgagga cttcctgctg gagcacatcc gcatcctcaa ggtgctgtgg 
gcttcctga tccatgtgca gggcagtatc cgccagttcg ccgcctgcct tgtgctcacc 
acttcggca tcgcagtctt cgagatcccg caccaggagt ctcggggcag cagccagcac 
tcctctcct ccctgcgctt tgtcttttgc ttcccgcatg gcgacctcac cgagtttggc 
tcctcatgc cggagctgtg tctggtgctc aaggtacggc acagtgagaa cacgctcttc 
tt^tctcgg acgccgccaa cctgcacgag ttccacgcgg acctgcgctc atgctttgca 
ccikccaca tggccatgct gtgtagcccc atcctctacg gcagccacac cagcctgcag 
agfftcctgc gccagctgct caccttctac aaggtggctg gcggctgcca ggagcgcagc 
agSgctgct tccccgtcta cctggtctac agtgacaagc gcatggtgca gacggccgcc 
ggis.^ctact caggcaacat cgagtgggcc agctgcacac tctgttcagc cgtgcggcgc 
cqbgctgcg cgccctctga ggccgtcaag tccgccgcca tcccctactg gctgttgctc 
cogcccagc acctcaacgt catcaaggcc gacttcaacc ccatgcccaa ccgtggcacc 
acj^ctgtc gcaaccgcaa cagcttcaag ctcagccgtg tgccgctctc caccgtgctg 
tg^kcccca cacgcagctg tacccagcct cggggcgcct ttgctgatgg ccacgtgcta 
agctgctcg tggggtaccg ctttgtcact gccatcttcg tgctgcccca cgagaagttc 
acttcctgc gcgtctacaa ccagctgcgg gcctcgctgc aggacctgaa gactgtggtc 
tcgccaaga cccccgggac gggaggcagc ccccagggct cctttgcgga tggccagcct 
ccgagcgca gggccagcaa tgaccagcgt gccgaggagg tcccagcaga ggctctggcc 
cggccccag tggaagtccc agctccagcc ccgg 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1954 
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3) INFORMATION FOR SEQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 3 



ATTCCAGTT 


TAATACTAAC 


CCTAATGTGT 


GACTGCGGTT 


TACAAAGAGC 


TCTGTATCAC 


60 


:tgggatagc 


TTTCAGTAGC 


AATTCACTAC 


AACTGGTCCT 


AAAAAATAAT 


AACAATAATA 


120 


lTAATAATTA 


GAGAATTAAA 


ACCCAACAGC 


ATGTTGAATG 


GTTAAAATCA 


CGTAAGAACT 


180 


iaaatttggg 

:Tq|lAAGCAG 


GTGGGGGTGT 


CCTCAACAGC 


TGAGCTTGTC 


CTAGCAGTGA 


AAATGCTCGC 


240 


GGCTCAGAAA 


GGTCTGGAGC 


CCTCCAGGCA 


GAGGGCTGAG 


CTCAGGGGGC 


300 


?ct1ggagga 


CACTCACCCC 


ATGGTCCATG 


GGATGCTTCT 


GGCTTCCTTA 


AAAACAGTTG 


360 


;Gq^TCCGCA 


TTGTATAAGT 


AGGTGGAGAC 


CCTAGTGTGG 


TTCTTTTGAA 


GGATATGGGA 


420 


vGG^AGGATG 


ACGAACTAGA 


GAAGTGGGAG 


GG.GACCAAAA 


TCACTGAGGT 


CCCAGAATAT 


480 


:a'^agatttg 


GGTATAGGAT 


TGGGGTCACT 


AAGAATTGAG 


CACCAGGAAT 


TCCAGCTTCT 


540 


?ccSIattaaa 


GAAACTGGGA 


CTGGTTTTGC 


CTTGGAGGCC 


TATGTAGTGT 


TTTCTGCCCC 


600 


^gtSccatac 


CAAGTCTCAT 


TGATATTTCT 


GCAGAATATC 


AGATGAAAAT 


CTATTTCTAA 


660 


iGAHCATTGG 


GAGAATGGGT 


GGTGGAGAAG 


GAGTTGGAGT 


GGGGTTGGGG 


GGCAGTTAAA 


720 


iATGAATAAA 


AATCTCTCAG 


CTACAGAACC 


CAAACATCAC 


TTCCCTCCGC 


ATTCACAGCA 


780 


?TTCCCAGCA 


GTCCCCAGAT 


GGTTGTTTCC 


GTGGGGACAC 


AGCAGCTGCC 


TCATTTCCCT 


840 


:'CAGGCCCCA 


TGGGCTGCTG 


GTCAACCTCA 


GGATCTACTA 


AAGATGACGC 


AAATGCCGAC 


900 


^GAACAATCT 


GAAACCCAAA 


GGACTCGAGG 


AGAGACATGT 


TCTGCTGAGG 


AGAGAAAGGT 


960 


JAGCCAAGGG 


CAGGGCCCAG 


GTCCCCCAGG 


GGGCCCCCGA 


GAGCCCGGAC 


ATGCACCTTC 


1020 


^GGATGTGTT 


TGTTCAAGTA 


GGACTTAGAG 


CGGAAGAAGC 


TCCCACATTC 


AGGGCATGGG 


1080 


.^ACTTCTTCT 


CCCCATCAGA 


CTCCATTTTG 


TTTTTGGGGA 


CTGCCATGTC 


GCAGGAGAAA 


1140 
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^GCCATTGG CACTCTGCTT CTCTGGCGTC TTCAGGTCGC TGGCATCTGA GAGGTCACCA 12 0 0 

\GGAGTCAG AGCTCTCAAT CGGATCCTGA TGTGAGCATT TCTGGCCTTC TCGGTTACAG 12 60 

TACTGCAGA AGTTGCTGGG CCCCTCGCTG TGCTTCTTCA GGTGGTCTGC CATGTATGCT 13 2 0 

:CCGCAAGT ACTTCCCACA CACCTGGCAG GGCACCTTGT CTTCATGACA GGCCAGGTGG 13 8 0 

\GCGCAGAC GGTCTCGGGT GGCAAAAGAA GCATTGCAGG TCTGACACTT GTGAGGCCGC 14 4 0 

:AGAAGTGT GCACCTGCTT GATATGTCCG TTCAAGTGAT CAGGCCTGGA GAAGCCTTTC 1500 

:ACAGCTCT GGCAGATGTA AGGCGGAATT CCCCAGAGAA GAAGGGTGGT GAAGACTCCC 15 60 

SCTCTCAGC TGCCCCCTGC ATCAGACCCA GCAGCTCCCC TCCCACTGTG GCTCCCGCAT 162 0 

TGCCTCCCT GCCCCAGCCC ATCCTCTCTA ACCAAGGAAT CATGTTCGTT CAGGAGGAGG 168 0 

:CTGGCCAG CAGCCTCTCG TCCACTGACA GTCTGACTCC CGAGCACCAG CCCATTGCCC 17 4 0 

2GGATGTTC TGATTCCTTG GAGTCCATCC CTGCGGGACA GGCAGCTTCC GATGATTTAA 18 00 

3Gi|pGTGCC AGGAGCTGTT GGTGGTGCAA GCCCAGAACA TGCCGAGCCG GAGGTCCAGG 18 60 

ZcfgCCGGG GTCTGGCCAG ATCATCTTCC TGCCCTTCAC CTGCATTGGC TACACGGCCA 192 0 

srs 

:a|^CAGGA CTTCATCCAG CGCCTGAGCA CACTGATCCG GCAGGCCATC GAGCGGCAGC 198 0 

;CCTGCCTG GATCGAGGCT GCCAACCAGC GGGAGGAGGG CCAGGGTGAA CAGGGCGAGd 2 04 0 

^GfGGATGA GGAGGAGGAA GAAGAGGAGG ACGTGGCTGA GAACCGCTAC TTTGAAATGG 2100 

;cSfccCAGA CGTGGAGGAG GAGGAGGGAG GAGGCCAGGG GGAGGAAGAG GAGGAGGAAG 2160 

SGi^GATGA AGAGGCCGAG GAGGAGCGCC TGGCTCTGGA ATGGGCCCTG GGCGCGGACG 22 2 0 

SG^TTCCT GCTGGAGCAC ATCCGCATCC TCAAGGTGCT GTGGTGCTTC CTGATCCATG 2 2 80 

JCAGGGCAG TATCCGCCAG TTCGCCGCCT GCCTTGTGCT CACCGACTTC GGCATCGCAG 2 340 

:TTCGAGAT CCCGCACCAG GAGTCTCGGG GCAGCAGCCA GCACATCCTC TCCTCCCTGC 2 4 00 

:TTTGTCTT TTGCTTCCCG CATGGCGACC TCACCGAGTT TGGCTTCCTC ATGCCGGAGC 24 60 

;TGTCTGGT GCTCAAGGTA CGGCACAGTG AGAACACGCT CTTCATTATC TCGGACGCCG 2 52 0 

:AACCTGCA CGAGTTCCAC GCGGACCTGC GCTCATGCTT TGCACCCCAG CACATGGCCA 2 58 0 

;CTGTGTAG CCCCATCCTC TACGGCAGCC ACACCAGCCT GCAGGAGTTC CTGCGCCAGC 2 64 0 

;CTCACCTT CTACAAGGTG GCTGGCGGCT GCCAGGAGCG CAGCCAGGGC TGCTTCCCCG 27 00 
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TACCTGGT 


CTACAGTGAC 


AAGCGCATGG 


TGCAGACGGC 


CGCCGGGGAC 


TACTCAGGCA 


2760 


ATCGAGTG 


GGCCAGCTGC 


ACACTCTGTT 


CAGCCGTGCG 


GCGCTCCTGC 


TGCGCGCCCT 


2820 


GAGGCCGT 


CAAGTCCGCC 


GCCATCCCCT 


ACTGGCTGTT 


GCTCACGCCC 


CAGCACCTCA 


2880 


GTCATCAA 


GGCCGACTTC 


AACCCCATGC 


CCAACCGTGG 


CACCCACAAC 


TGTCGCAACC 


2940 


AACAGCTT 


CAAGCTCAGC 


CGTGTGCCGC 


TCTCCACCGT 


GCTGCTGGAC 


CCCACACGCA 


3000 


TGTACCCA 


GCCTCGGGGC 


GCCTTTGCTG 


ATGGCCACGT 


GCTAGAGCTG 


CTCGTGGGGT 


3060 


CGCTTTGT 


CACTGCCATC 


TTCGTGCTGC 


CCCACGAGAA 


GTTCCACTTC 


CTGCGCGTCT 


3120 


AACCAGCT 


GCGGGCCTCG 


CTGCAGGACC 


TGAAGACTGT 


GGTCATCGCC 


AAGACCCCCG 


3180 


ACGGGAGG 


CAGCCCCCAG 


GGCTCCTTTG 


CGGATGGCCA 


GCCTGCCGAG 


CGCAGGGCCA 


3240 


AATGACCA 


GCGTCCCCAG 


GAGGTCCCAG 


CAGAGGCTCT 


GGCCCCGGCC 


CCAGTGGAAG 


3300 


CCAGCTCC 


AGCCCCGG 










3318 
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) INFORMATION FOR SEQ ID NO: 4 
iTsCi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 
J3 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
^0 (D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
SGAGGAGG AAGAGGAGGA TGAAGAGGCC GAGGAGGAGC GCCTGGCTCT GGAATGGGCC 60 
3GGCGCGG ACGAGGACTT CCTGCTGGAG CACATCCGCA TCCTCAAGGT GCTGTGGTGC 12 0 
-CTGATCC ATGTGCAGGG CAGTATCCGC CAGTTCGCCG CCTGCCTTGT GCTCACCGAC 18 0 
:GGCATCG CAGTCTTCGA GATCCCGCAC CAGGAGTCTC GGGGCAGCAG CCAGCACATC 24 0 
:TCCTCCC TGCGCTTTGT CTTTTGCTTC CCGCATGGCG ACCTCACCGA GTTTGGCTTC 3 00 
:ATGCCGG AGCTGTGTCT GGTGCTCAAG GTACGGCACA GTGAGAACAC GCTCTTCATT 3 60 
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'CTCGGACG 


CCGCCAACCT 


GCACGAGTTC 


CACGCGGACC 


TGCGCTCATG 


CTTTGCACCC 


420 


.GCACATGG 


CCATGCTGTG 


TAGCCCCATC 


CTCTACGGCA 


GCCACACCAG 


CCTGCAGGAG 


480 


'CCTGCGCC 


AGCTGCTCAC 


CTTCTACAAG 


GTGGCTGGCG 


GCTGCCAGGA 


GCGCAGCCAG 


540 


;CTGCTTCC 


CCGTCTACCT 


GGTCTACAGT 


GACAAGCGCA 


TGGTGCAGAC 


GGCCGCCGGG 


600 


.CTACTCAG 


GCAACATCGA 


GTGGGCCAGC 


TGCACACTCT 


GTTCAGCCGT 


GCGGCGCTCC 


660 


;CTGCGCGC 


CCTCTGAGGC 


CGTCAAGTCC 


GCCGCCATCC 


CCTACTGGCT 


GTTGCTCACG 


720 


:CCAGCACC 


TCAACGTCAT 


CAAGGCCGAC 


TTCAACCCCA 


TGCCCAACCG 


TGGCACCCAC 


780 


CTGTCGCA 


ACCGCAACAG 


CTTCAAGCTC 


AGCCGTGTGC 


CGCTCTCCAC 


CGTGCTGCTG 


840 


.CCCCACAC 


GCAGCTGTAC 


CCAGCCTCGG 


GGCGCCTTTG 


CTGATGGCCA 


CGTGCTAGAG 


900 


GCTCGTGG 


GGTACCGCTT 


TGTCACTGCC 


ATCTTCGTGC 


TGCCCCACGA 


GAAGTTCCAC 


960 


CCTGCGCG 
CA&ACCC 


TCTACAACCA 


GCTGCGGGCC 


TCGCTGCAGG 


ACCTGAAGAC 


TGTGGTCATC 


1020 


CCGGGACGGG 


AGGCAGCCCC 


CAGGGCTCCT 


TTGCGGATGG 


CCAGCCTGCC 


1080 


GC]®CAGGG 


CCAGCAATGA 


CCAGCGTCCC 


CAGGAGGTCC 


CAGCAGAGGC 


TCTGGCCCCG 


1140 


CC&VGTGG 

: s ; 


AAGTCCCAGC 


TCCAGCCCCG 


G 
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U (5) INFORMATION FOR SEQ ID NO : 5 

ffl (i) SEQUENCE CHARACTERISTICS: 

=0 (A) LENGTH: 651 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 5 

Met Thr Gly Gin Val Gly Ala Gin Thr Val Ser 
1 . 5 10 

Gly Gly Lys Arg Ser lie Ala Gly Leu Thr Leu Val Arg Pro Leu Arg 
15 20 25 
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Ser Val His Leu Leu Asp Met Ser 
30 35 



Ala Phe Pro Thr Ala Leu Ala Asp 
45 50 



Lys Gly Gly Glu Asp Ser Trp Leu 
60 65 



Ser Ser Ser Pro Pro Thr Val Ala 

80 



Val Gin Val lie Arg Pro Gly Glu 

40 

Val Arg Trp Asn Ser Pro Glu Lys 
55 

Ser Ala Ala Pro Cys lie Arg Pro 
70 75 

Pro Ala Ser Ala Ser Leu Pro Gin 
85 90. 



Pro lie Leu Ser Asn Gin Gly lie Met Phe Val Gin Glu Glu Ala Leu 
95 100 105 

=1 Ala Ser Ser Leu Ser Ser Thr Asp Ser Leu Thr Pro Glu His Gin Pro 
Q 110 115 120 

F- lie Ala Gin Gly Cys Ser Asp Ser Leu Glu Ser lie Pro Ala Gly Gin 
fJ 125 130 135 

"-'^ Ala Ala Ser Asp Asp Leu Arg Asp Val Pro Gly Ala Val Gly Gly Ala 

^. 140 145 150 155 

3 Ser Pro Glu His Ala Glu Pro Glu Val Gin Val Val Pro Gly Ser Gly 
y 160 165 170 

Gin lie lie Phe Leu Pro Phe Thr Cys lie Gly Tyr Thr Ala Thr Asn 
175 180 185 

Gin Asp Phe lie Gin Arg Leu Ser Thr Leu lie Trp Gin Ala lie Glu 
190 195 200 

Trp Gin Leu Pro Ala Trp lie Glu Ala Ala Asn Gin Trp Glu Glu Gly 
205 210 215 

Gin Gly Glu Gin Gly Glu Glu Glu Asp Glu Glu Glu Glu Glu Glu Glu 
220 225 230 235 
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Asp Val Ala Glu Asn Arg Tyr Phe Glu Met Gly Pro Pro Asp Val Glu 

240 ^ 245 250 

Glu Glu Glu Gly Gly Gly Gin Gly Glu Glu Glu Glu Glu Glu Glu Glu 
255 260 265 

Asp Glu Glu Ala Glu Glu Glu Arg Leu Ala Leu Glu Trp Ala Leu Gly 
270 275 280 

Ala Asp Glu Asp Phe Leu Leu Glu His lie Arg lie Leu Lys Val Leu 
285 290 295 

Trp Cys Phe Leu lie His Val Gin Gly Ser lie Arg Gin Phe Ala Ala 
300 305 310 315 

.^^ Cys Leu Val Leu Thr Asp Phe Gly lie Ala Val Phe Glu lie Pro His 

320 325 330 

"^^P Ser Gin His lie Leu Ser Ser Leu Arg Phe 

:t 335 340 345 

y Val Phe Cys Phe Pro His Gly Asp Leu Thr Glu Phe Gly Phe Leu Met 
350 355 360 

O '^^^ Cys Leu Val Leu Lys Val Arg His Ser Glu Asn Thr Leu 

iTj 365 370 375 

Phe lie lie Ser Asp Ala Ala Asn Leu His Glu Phe His Ala Asp Leu 
380 385 390 395 

Arg Ser Cys Phe Ala Pro Gin His Met Ala Met Leu Cys Ser Pro lie 

400 405 410 

Leu Tyr Gly Ser His Thr Ser Leu Gin Glu Phe Leu Arg Gin Leu Leu 
415 420 425 



Thr Phe Tyr Lys Val Ala Gly Gly Cys Gin Glu Arg Ser Gin Gly Cys 
430 435 440 
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Phe Pro Val Tyr Leu Val Tyr Ser Asp Lys Arg Met Val Gin Thr Ala 
445 450 455 

Ala Gly Asp .Tyr Ser Gly Asn lie Glu Trp Ala Ser Cys Thr Leu Cys 
460 465 470 475 

Ser Ala Val Arg Arg Ser Cys Cys Ala Pro Ser Glu Ala Val Lys Ser 

480 485 490 

Ala Ala lie Pro Tyr Trp Leu Leu Leu Thr Pro Gin His Leu Asn Val 
495 500 505 

lie Lys Ala Asp Phe Asn Pro Met Pro Asn Arg Gly Thr His Asn Cys 
510 515 520 



Arg Asn Arg Asn Ser Phe Lys Leu Ser Arg Val Pro Leu Ser Thr Val 
13 52 5 53 0 53 5 

U Leu Leu Asp Pro Thr Arg Ser Cys Thr Gin Pro Arg Gly Ala Phe Ala 
=(: 540 545 550 555 

ijl 

ill Asp Gly His Val Leu Glu Leu Leu Val Gly Tyr Arg Phe Val Thr Ala 
. 560 565 570 

;z: lie Phe Val Leu Pro His Glu Lys Phe His Phe Leu Arg Val Tyr Asn 
:nr 575 580 585 

^1=? Gin Leu Arg Ala Ser Leu Gin Asp Leu Lys Thr Val Val lie Ala Lys 
590 595 600 



Thr Pro Gly Thr Gly Gly Ser Pro Gin Gly Ser Phe Ala Asp Gly Gin 
605 610 615 

Pro Ala Glu Arg Arg Ala Ser Asn Asp Gin Arg Pro Gin Glu Val Pro 
620 625 630 635 

Ala Glu Ala Leu Ala Pro Ala Pro Val Glu Val Pro Ala Pro Ala Pro 

640 645 650 
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(6) INFORMATION FOR SEQ ID NO: 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 390 amino acids 

(B) TYPE: polypeptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 6 

Glu Glu Glu Glu Glu Glu 
1 5 

Asp Glu Glu Ala Glu Glu Glu Arg Leu Ala Leu Glu Trp Ala Leu Gly 
10 15 20 

Ala Asp Glu Asp Phe Leu Leu Glu His lie Arg lie Leu Lys Val Leu 
25 30 35 

Trp Cys Phe Leu lie His Val Gin Gly Ser lie Arg Gin Phe Ala Ala 

U 40 4 5 50 

?- 

-C Thr Asp Phe Gly lie Ala Val Phe Glu lie Pro His 

M ^5 60 ' 65 70 

]Z Gin Glu Ser Trp Gly Ser Ser Gin His lie Leu Ser Ser Leu Arg Phe 
]~J 75 80 85 

Val Phe Cys Phe Pro His Gly Asp Leu Thr Glu Phe Gly Phe Leu Met 
90 95 100 

Pro Glu Leu Cys Leu Val Leu Lys Val Arg His Ser Glu Asn Thr Leu 
105 110 115 

Phe lie lie Ser Asp Ala Ala Asn Leu His Glu Phe His Ala Asp Leu 
120 125 130 

Arg Ser Cys Phe Ala Pro Gin His Met Ala Met Leu Cys Ser Pro lie 
135 140 145 150 

Leu Tyr Gly Ser His Thr Ser Leu Gin Glu Phe Leu Arg Gin Leu Leu 
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155 160 165 

Thr Phe Tyr Lys Val Ala Gly Gly Cys Gin Glu Arg Ser Gin Gly Cys 
170 175 180 

Phe Pro Val Tyr Leu Val Tyr Ser Asp Lys Arg Met Val Gin Thr Ala 
185 190 195 

Ala Gly Asp Tyr Ser Gly Asn lie Glu Trp Ala Ser Cys Thr Leu Cys 
200 205 210 

Ser Ala Val Arg Arg Ser Cys Cys Ala Pro Ser Glu Ala Val Lys Ser 
215 220 225 230 

Ala Ala lie Pro Tyr Trp Leu Leu Leu Thr Pro Gin His Leu Asn Val 

235 240 245 

lie Lys Ala Asp Phe Asn Pro Met Pro Asn Arg Gly Thr His Asn Cys 
250 255 260 



=E Arg Asn Arg Asn Ser Phe Lys Leu Ser Arg Val Pro Leu Ser Thr Val 
a 265 270 275 

Leu Leu Asp Pro Thr Arg Ser Cys Thr Gin Pro Arg Gly Ala Phe Ala 

% 280 285 290 

!f Asp Gly His Val Leu Glu Leu Leu Val Gly Tyr Arg Phe Val Thr Ala 

295 300 305 310 

lie Phe Val Leu Pro His Glu Lys Phe His Phe Leu Arg Val Tyr Asn 

315 320 325 

Gin Leu Arg Ala Ser Leu Gin Asp Leu Lys Thr Val Val lie Ala Lys 
330 335 340 

Thr Pro Gly Thr Gly Gly Ser Pro Gin Gly Ser Phe Ala Asp Gly Gin 
345 350 355 

Pro Ala Glu Arg Arg Ala Ser Asn Asp Gin Arg Pro Gin Glu Val Pro 
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360 365 370 

Ala Glu Ala Leu Ala Pro Ala Pro Val Glu Val Pro Ala Pro Ala Pro 
375 380 385 390 



(7) INFORMATION FOR SEQ ID NO: 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 7 
CTTGAGGATG CGGATGTGCT 2 0 

(8) INFORMATION FOR SEQ ID NO: 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
CCATGGGGTG AGTGTCCT 18 

(9) INFORMATION FOR SEQ ID NO: 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO 
AGGACACTCA CCCCATGG 18 

(10) INFORMATION FOR SEQ ID NO: 10 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO 
GTATGGGACA GGGGCAG AAA 2 0 



(11) INFORMATION FOR SEQ ID NO: 11 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO 
TTTCTAAAGA CCATTGGGAG 2 0 

(12) INFORMATION FOR SEQ ID NO: 12 
(i) SEQUENCE CHARACTERISTICS: 





(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
CCATTTTAAA GTAGCGGTTC 2 0 

(13) INFORMATION FOR SEQ ID NO: 13 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 13 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GTAGATCCTG AGGTTGACCA 2 0 

(15) INFORMATION FOR SEQ ID NO: 15 
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m AGGAGAGAAA GGTGAGCCAA 



20 



(14) INFORMATION FOR SEQ ID NO: 14 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(P) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TGTGAGCATT TCTGGCCTTC 2 0 



(16) INFORMATION FOR SEQ ID NO: 16 
^ (i) SEQUENCE CHARACTERISTICS: 

^0 (A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 
m (C) STRANDEDNESS: single 

y (D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
^ TGAAGACGCC AGAGAAGCAG 2 0 

'fl (17) INFORMATION FOR SEQ ID NO: 17 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
GCCTCACAAG TGTCAGACCT 2 0 
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(18) INFORMATION FOR SEQ ID NO: 18 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
AGAAGGGTGG TGAAGACT 18 

(19) INFORMATION FOR SEQ ID NO: 19 
•if (i) SEQUENCE CHARACTERISTICS: 
t^J (A) LENGTH: 20 base pairs 
U} (B) TYPE: nucleic acid 

W (C) STRANDEDNESS: single 

Ni: (D) TOPOLOGY: linear 

ip (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

JJl CTTGGTTAGA GAGGATGGGC 2 0 

(20) INFORMATION FOR SEQ ID NO: 20 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
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GCCCATCCTC TCTAACCAAG 2 0 

(21) INFORMATION FOR SEQ ID NO: 21 
(i) SEQUENCE CHARACTERISTICS: 

(A) L,ENGTH: 15202 nucleic acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 

•f! (C) IDENTIFICATION METHOD: 

(D) OTHER INFORMATION: /note="N is unknown or other" 
ffj (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 21 

fxl GATCCGAGCTCAATTAACCCTCACTAAAGGGAGTCGACTCGATCCTTAAA 

ATATTCATATCTCCTGGACAACCTGTGGCCATAGTGCCTGACTGTAAACC 
p CAAAGGGTTTGCCTTTGCCAGTGTAGCCCAGCCTGGTGTCTGCTGCCCCT 
.fj CGCGGTGTCTGTGCACCTGCCACGATGCTGACCAGACACCCTTAACCAGG 
•^^ TTCACCCATCGCCTGGGCCTGGAGCAGTCQCCCTGATGCTCTGATTGGTC 

CTTGGACCTTCTGTTCTCCCAAAATCCCAGGTCAGAAAATACCTGGAAGT 

i 1 i 

CTATTTGTGTCCCACCTCCCTCTTTGTGGCCGCAAGTGCCCCTTCCTCCA 

i \ '■ ^ 

CACAGTCACAAGACCATGAGATGCCATCTCCTCCCCTCCTGGGCTGCAGA 

1 ' i 

' i i 

CTTTGGGAAGCTCCCAGGCCACAGAGGTGTCAGCTCCTGTCCAGGCCCTT 
GGGACCTTCCCTCATTCAACpACCCTACCCAACCCCCCACTGCCTGCCAG 
CCACCACTCCCTCCCACATTTGCAGGCGGGbGCCCTGCCCTCTCCTGCCG 
CTGGTTCCCCTACCCAGGAGGCTCTCCCATCGCTCTTTTGAGAGTCTGCC 
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TCCCACCTCTAACTGGGGGCTTAGTTCAAGTTGCCCCCTTACCCTAGTCC 

! ! 

CAGCTGCCCAAGAGCTTGCTGCCTCCTGTTCTTGGTGAGGGACTCCAGAG 

i I 

ACAGATGTGAGACCTCCCTGGACCCCTCCAAGGCATTCCCAGGTCACTTC 
CATGAGTAGTGAAGAACCGCCTCTGAGCAGGCTGAGCCTCCCTCAGCCTA 
TGGTGTCCTCACGTGGCTTGGCCCACAGCAGGTGCTCACGCCTCCTCCTC 
AGCAGAGCCTACCATCCTCCTGCCATGCTCACCAGTCCCCATGCTGATAG 
CCATCACCAGTCCCCATGCTGATAGCCATCACCAGTCCCCATGCTGATAG 
CCACTTTCTGGATGCTCTAGGTCTGTCTGGATGACACAGTGACCACAGAG 
AAGGAGCTGGAGACTGTGGAAGTGCTGAAAGCAATTCAGAAAGCCAAGGA 
GGTCAAGTCCAAACTGAGCAACCCAGAGAAGAAGGTGGGTTTGTGTGGCA 
GGTGGGAGGGCAGTGGTGCAGAGCCAGCCGGGATAGGAGCCAGTTCGGGG 
GGCTTGGGCCATGGGACTGCTCAGGGCTGCCGAGTCCCAGCTGCGCCCCT 
M CCCTGGCTGCATGACCTCGGGCAAGTCGCGGCCTCTCTGTTCTCTGTGGG 
yi GTGGGGACAGTGGTAGTTCCTGCTCTAAGGATATGATGAGACCATCTTTA 
IxJ CCACCCAGTTGGTGGGAACCGTTGCGCTCCCTCCTCACACCCCTGGCCTT 
GGGGAGCTCTGTGCTTCCTCTTCTCTCCCGGGCTGACTCAAGCACTCGTC 
CTCAGGGTGGTGAAGACTCCCGGCTCTCAGCTGCCCCCTGCATCAGACCC 
% AGCAGCTCCCCTCCCACTGTGGCTCCCGCATCTGCCTCCCTGCCCCAGCC 

. ?t 

=y CATCCTCTCTAACCAAGGTAATCGTGTATGTATCTTGCTTCTAGTGGAGC 
CACACAGCCCTGCCTGGGCCCCCTGGCTGGGCTGGGGTTGGGGGAGAGGT 
GCCAGCACCTGCTTCCAACAGGGTCAGACACAGGGAGGGCAGTGCCTTCT 
GCAGGCTGGTCCTCGCGGGGGGACACATGGCAGGGGTGCCTGGCCTGATG 
CCAGCTGTTGCTTGCTTGGTGAGGACTCCCAATTGCTCTGATGCCCACAT 
CCAGCTCCTCTAGGAGACCGCAGGGTGTCTGACAGGCCCTGAGGCTGCCC 

TCTGAACAGGCTCGGGGCTGTTGGCTCATGGGACCCATTCCCTCACCGGC 

! i 

AGCACAAGCAGGTTGGCTCCTGGTTACAGGAAGCCGGGCTTGTGACTTTA 
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% *• 

CTGTCTGGAGCCCGAATCCGTGTGCAGGGAAAAGCTTGCip 
CCTCATCTCTGTGGGGTGACCCAGCCCCAGAACACCATGTTTGTGGGGCC 

! I i 1 

AAGATGGGCCATCTCTGTcdcTGTGGACCCATGGAAGACCAGGCCCATTC 

ill' 

GTCTGCCCACTATCTTAGCGTTTTCAAAGGGCTTTCACCTCTGAACCCAG 
GCATCCTCGGAGATGAGTGAGTGAAGCAGGTCTCATGAGCGTGTCTGCTG 
GCCCGGCCCCCACGGAAGAGGGGAGGGTGTpCCGTCCCGAGTGGAGCCGA 

GGCTCGGGACACGCAGGAAAGGACGCCGCCTGCCCGGGCTCCTGGAGACG 

i 

i ! 

CAGAACTTGGTGTGAGGTCTTGGGAAAACAGTTCAACCCGATGTTTTAAG 
AGCCAGAAAAACATTCCCACCCCTTGACCTGGTAACCCCACTGGTGGGGA 
TTTTCTCTTAGAGGGATAAGATACCGGGAAGGGGAGGTGAAATGCTCACC 
ACTGCCAAAACACGGGCTGCAACTGCAACATCGGAGGATGAGAGGGAGAG 
TCGGCTGTGGTGCAGAATGCTCAGCAGCCCTCCCAGCAGGGACAGGAAGA 
^ CTGGGCAGGAAGAGGGGAGAAGCATTCAAGTTAAGGCAAAAGGCCCAACG 

tJ] cagagcagcacactgaggtc'acacctgtgagatgtggaagagaattcctg 
y agcgtggagcgatggggttaggtgccaggatgattgcccattttgcttct 
U gtcagactcttgactaaggatttctggttgcattttattacataaaagcc 

o 

Q agggaggttatatcacggtgagaaagcttccctgacgccgcctcctgtag 
cgcagccaagcgagcctgtggaggtaccatatgactgtaggcctctgggg 
acagggagctgcatctgcttctcaaggccagggacacagccatttctgcc 
agcatctgttgatcagtgagtgagtgagtgggcaggtagagcaggagcca 
gtgaagagcaggccctggatgggtggggatgcaccatgtccccaggctgc 
agctgcaggcagccccccacattgtcggagaagcctctggaccagctcag 
ccccctcctcactccccttgtgccctggggacactctgcagaggggcact 
ctgcagtctgtccccgccatcgctggacttctggacatgdcctccagatt 
tgcacctcttaaataaatctgcagtggatgtctttgtgtgcacctctctt 
tccttttggtgagaaacagcaaagatcggacccctaaggactctcctgat 

i 
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r 1 t I 

GTCTCCGCTCTATCCGCTGAGTGCCCTTTCpGACCACTTGrTTGTACAGG 
CCACGGTCCAGGACGGGAGCAGATAGACTGTCCCTGTCCqTGTCCACATT 

I ^ i 

TCCTTGGTCCAAACAGGGCTTGTGGGAGGTAGTGGCAAAAGGTGTTGGTC 
TTTTTCTCACTGATTTGGAGGCCTCCCCGTGTGTTTTTTCAGCCGCGTGT 
TCCTGGGTCTTGCCTGGATGGACAGGGTTTTTTAGCGCGTGGGAGCAGCT 

! 

: "I 

TTGCTGACCATGCCTGTTGCTTCCAGCCTGATTCCCGAGAAGGGAGCGTG 
CTTGCGAAGGAACTGGCACTCGGGCCTGCCTGAAGGGGGCGCTGTCCAGA 
CACACCCAGCCTCCCGTCGTGGCAGGCGCTGTCGGAGCCATGGATGATTG 
TGACCAATAGGGGTGGTCGCCAGAGTTGATTGTCCAGCCAGGCCCAGGGG 
CTGAGAGGAGGCTGTGTGGAGAGGTGGTTA'GGAGCCAGGGCTCGGTCAGC 

tgagttcgcatgccagcttcctagctgtgggacctcaagdaacttgtagc 
ccctctgaagctgttttctcaactgtgaagtggacgcaccctacttcatt 
gattctaagaggcacgcatttccaccttgtgacttctctgaaactgaggt 
gcgtctttcagtcagtggcgtctcatagtcgctgtcagccagctggtatt 
cgagatggagtcgtggaaaacccgtggacaccttccgcta;ggaccaagat 
ggcgccacctgccgcatcttagatttgatgaaatgtggtaaataacgaga 
ggcatgcatgagcgaatgctggggaggcgcttggcactac'ccagagctcc 
acagaggtggtcgatgagggctgccctttcccacatccttagtagggggt 
tcaagatgacccagactgtgcccctggggagcttggagccatgcgggagg 
atgagccatgtgctggaggagaacagggtaggatggtgtggggcttttgt 
agactgtctagagcagagaaggtctgcagtggaggtggtcirctgaggtga 
atctcgaaggtgaataggagttgaacgttagcaggcagagggtggattgc 
aggagagcagcggcctgggcaggtgcccagcgtggcccattagggtgctt 
catgcatggctgtgtgcttgccatccttcctgcctgcctaccccctgctg 

CTTCGCTTCATGGGGGCGTTTGAGCTTGGGCCCACCTGCCTGCCTCGCTT 

gtgggcagaggacccaggctgtgtgagttgtcctgtcccggggagcagct 
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GAGCTTGTCCGGGTTCCTCG|vCCTGTGGGGtTTCAGAGGAfcTTCGGGTCA 

TTTCAATGGGCTGTGGCGATGCTGGCTGTGGAGGTAGCCTAGGGCTCCTG 

■ i 

TAGCCTTCAGTGAGACTGGCGGCCCGATGCCCAGTGTTCACCCTGCTGGC 
GGCAGTCAGGAACATGTTCACAAAGCTTTACTTCAAGTGGTCTAGAGGTG 
ATCTGAGGTGGAGTAACAGGTCCAGATAGGCTACGTTCATAAAACAGCTT 
CAGCGGGGTTTAGGAACACTGTGCATTTACGGGACGCAGTGGGTCAGAGT 
GCTGCTGTCCGTGGGAGGTGGCCCCAGGGCAGGTCAGTGGGCACGTCCTG 
TGGTAAGTGGGACTGTGGATGTGGGCTCAGGCTGGACTCAGC'AGCCCTGC 
TGGATACCAAGGCCTGCAAGGGCTGGCCCCCTGGTGAATTGTCCCGTGCC 
CTGTGTATCTATGAGTCCTGCAGAGATGACAAATCAGGGGACGGGGTCAT 

GTCTAGTCACCGTCTGGGAAAATGCTCCAGGAGTGAACACATTTCAGGCT 

y / 

-.J, CTTGATGGATGTACCTCCAAACTGTTCTCTGGATGGGTGGGCCAGCTTGC 
H= ATGCCTGTGCCGGCCTCTGCCCAGCGAGGTCAGGGCCAGGCCA.CACAGTC 
iji AGTCTGACTTTGGCAGAAGTTGAGAGGCAACACTTGTCTCTTGtTTCAGC 
IxJ TTGCCTTTCTTTGTGTACTTCTGAGAGCGAGCATTCTTTTCATGTTCTAT 
CCGCTGGCCGTTCTTCTGCGGAATGTCTGTTCACGTCCTTTGCAGTCTGT 
□ TAATGAGGTTTCCAACCTTCCCTCATTTTTGTAATCTGTAAGAACTTTTT 
:JI CCAGACTAGCGATATAAATCCTTGTCAAATATTGCAAACACTTTTCTCAT 
TTCATCTGGTTTTAATCTATCCTGGTTTTTAAAAAATGTGTCTGTGGAAG 
TTTAATTTTTATGTAGTCACATCTCAGTTTTTTTCCATTGCATTTATTCT 
CAGAATGCTTCTCCCTGCCCTGAGATTAGATAAGCAGTCATTTGTTCTTT 

CTTGAGTTATTTTGAGATTTCAGTTTTAACATTTTCTTCTATAATCCATG 

i 

TGGCTGGGTTTTGGGATCTGGCTAACCCCCGCCATGCCAGTAGCCTGAGG 
GGCCCAGCCCCACTTGTTGAACAGCCGCTCTCCCCGCCCCACCCACCCTG 
CCTGCCTGCCCACCCGCCCTGGTCTCTCCAGGAATCATGT|rCGTTCAGGA 

i 

ggaggccctg'gccagcagcctctcgtccactgacagtctgactcccgagc 

! ! 
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ACCAGCCCATTGCCCAGGGATGTTCTGATTCCTTGGAGTCCATCCCTGCG 

GGACAGGTAATGCCCTCTTCCCGCTTCTGGGGACCATACATCTGTGGGTG 

i i 

GACTCTTCTGCTTGGGGTTGTGTGCAGTAGGAAGTGGCCTAGCTGGAGCT 

GAGGCAGATGCTTCCAGGGTTTGGCGTCCTCTGCTTTGCGCCACGGTCTT 

i 

TCTCTTGGACCTGTCTCTGGTTGAGTGTCTTCCTGACAAACACAGTGGTT 
AAGGGTTTATTTTCAGCCTCCCTCCTTCCCTTCCCCACCCACCTTGGTTG 
ATGGGAACAGGCAGTTCTCTGTCACTGGGCCCAGGGCACGAGGGGGGCAG 
GTGGAGAGGGTGGCCCTTGACCCTGTGAGCAGGCTTCCCTGGGGAAGGCA 
TTTCAAAAGACCCTCGTGCAGGGGCTTGTTTGGGTTTCTTCTCTGTTTCC 
TGGCACCCCTGGAGCCACTCGGCGCCTTTCCGCATGTCACCCTGGTGGTC 
TGGGAAACAGTCTCACTCTGGCGCCTCCTCTGTGGTTGTTACTGAGAGTT 
CTGGGGCCCCTTCCTTTGTCCTGAGGAAAGACAGGAGGAAAGCAAGGGTG 
\^ CTTGCTGTGTGCTTCGCAAATGTGCTTGGTGCCTGGGCCTCCCTCCAGCC 
fjl CCATCTCTGCAGCAGCACAAGGTTATGGCCTTGTGACACTGGGACAGTTT 
1x1 GCAGAGTCCTTGTCTGTCCTCAGTACTCCACAGTATTCTGCCATCACCCT 

1^ TTCCAGGGTCACACAGCAAGAGATTCCCAAGCCCTAGGTATTCCCCAGTG 

I'll 

Q CACAGAGACCATTGGGAGGGACTTGCCAGGGCTGTGTCCACTGCTGGCCA 
J; GTTAGGGTCGGACCAAATTTGTAGACTGTCTACCTGGACCCTTGCGTGGC 
ACAAGGAGCAGTCAGATGCTGGATCCCTGGAGAGTGGCGAGAGGCTCTGG 
CCTTAGGTTGCGAGTGGGAATCCCAGCCCTGCTGTGTGCTGGTGGGATAA 
CCAAGTGGGTCTCTGCCCTTGGGTCCCAGAGTGGGCCCCAGGGTCCCAGA 
GTGGGCTCCAGGGTACAGCGTGGGGATGGGGAGCCTCCTCAGGGCGGTGA 
TGGAGGGCAGAATGCCCAGCTCAGGGTCTGGCAACCAGTAAATGGCTGGG 
GCTGGCTGCAGTAGGTGGGGACTGACTGTGTTTCTTTCTCCATCAGGCAG 
CTTCCGATGATTTAAGGGACGTGCCAGGAGCTGTTGGTGGTGCAAGGTAA 
GGAAGAGGTTGGAAAGGGACCTGGGCCTGGCCACACAGCCTTATGCACAC 
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ACACTGCTGTGGGCCAGGGGTGGCCAGTCAbGTTTTTTTAkAAATCCGTT 

I ! 

CACAGAAGGCbTATAGAACTATTTCTTCCTCTAAAGAGAckcAGATGAGA 

i I 

tggacttttcaatctgtttccaaattctaatacctaaact::tgctcagca 

I i I 

CATGTTGCCCTACACCAGGGqTTGGCAAATCAAGGCCTGTGTGTGGCCCA 

CAGCCTGGGAGCTAAGAATG'ACAGTTACATTCTTTTTTCTrTTTTTGAGA 

i 

ctgagtctcgctctgtcgcdcaggctggagtgcagtggcgtgttcttggc 
tcactgcaacccccgcctcdcagattaatgcaattttcctgtctcagcct 
cagccttctgagtagcccggaccacaggcgcacgccacca'cgcccaacta 
attttttat^^tttttagtagagacagagattcaccatgtc gcctagctgg 

TCTCGAACTCCTGAACTCCAGTGATCCACCAACCTCGGCTTCCTAAAGTA 
CTGGAATTACAGGCATGAGCbACCGCGCCTGGCTAGAAT^ACAGTTACTT 

i 

tttttttctttgagactgagtcttgctttgtcacccaggc:tggagtgcag 
tggcacgatctcagctcgctgcaacctccgcctcccgggttcaagcgatt 
cttctgcctcagccacccaaggtgcccgccaccacacctggctaattttt 
ctgtttttagtagggacaggatttcgccatgttggacagtp?acattctta 
aagggctgctgaagatcgtatggacatggtagcccataaatcccaaaatg 
tgtactctgaccctttacagaagcttactaactcccactctacatgtgag 
ggctgcggtggccaagaagagctggaatttaagtgtgaaggtcctaagac 
ctgccccagcccacttccctgccccggaggccaccaggggtgacaagtag 
attcatgccctggagtgttccttctctccggggcttatggcagcaactga 
atgacttagaagtccatgggagtgctttctgttgtgggaactcgtgtggt 
ctgggcatagctgtgccaggcacctatggtccaagcccctagaagcatag 
actctgaccaaactggcgacccagccttccagcaggcagcactggctccc 
accagggccctcatcctgggaactgacttggccatgtgggaggcttggga 
gacccatgggttggtttctcagggtcagggtgtagcagtgggctccagat 
gtggcaggtgggaggtgggaggggcccctcccagcatgccactgacctgg 



84 




CCTCTCCCTG(:ACAGCCCAG?^CATGCCG;^bcCGGAGGTC|:AGGTGGTGC 
CGGGGTCTGGCCAGATCATCrTCCTGCCCTjTCACCTGCATTGGCTACACG 

! I 

GCCACCAATCj^GGACTTCATCCAGCGCCTGjAGCACACTGATCCGGCAGGC 
CATCGAGCGGCAGCTGCCTGiCCTGGATCGAiGGCTGCCAAcbAGCGGGAGG 

! ! j \ ' 

AGGGCCAGGGTGAACAGGGCGAGGAGGAGG-ATGAGGAGGAGGAAGAAGAG 

i • • I- I ■ 

GAGGACGTGGCTGAGAACCGiCTACTTTGAAATGGGGCCCCCAGACGTGGA 

;! i I ^ 

GGAGGAGGAGGGAGGAGGCCAGGGGGAGGJ^GAGGAGGAGGAAGAGGAGG 

: > 1 : 

; " i ! ; 

ATGAAGAGGCCGAGGAGGAGCGCCTGGCTdTGGAATGGGCCCTGGGCGCG 

\ \ 1 

GACGAGGACTTCCTGCTGGAGCACATCCGCATCCTCAAGGTGCTGTGGTG 
CTTCCTGATCCATGTGCAGdGCAGTATCCG'CCAGTTCGCCGCCTGCCTTG 

i ' ! \ 

TGCTCACCGACTTCGGCATCGCAGTCTTCGAGATCCCGCACCAGGAGTCT 
CGGGGCAGCAGCCAGCACATjCCTCTCCTCC'CTGCGCTTTGTCTTTTGCTT 

cccgcatggcgacctcaccgagtttggcttcctcatgccggagctgtgtc 

tggtgctcaaggtacggcacagtgagaacacgctcttcattatctcggac 

; i ; 

GCCGCCAACdTGCACGAGTTiCCACGCGGACCTGCGCTCATGCTTTGCACC 

I I ; " . 

CCAGCACATGGCCATGCTGTjGTAGCCCCATCCTCTACGGCAGCCACACCA 
GCCTGCAGGAGTTCCTGCGCCAGCTGCTCACCTTCTACAAGGTGGCTGGC 

i ; 

GGCTGCCAGGAGCGCAGCCAGGGCTGCTTCCCCGTCTACCTGGTCTACAG 

; ! ; ■ 

TGACAAGCGCATGGTGCAGACGGCCGCCGGGGACTACTCAGGCAACATCG 

; i 

; i 

AGTGGGCCAGCTGCACACTCTGTTCAGCCGTGCGGCGCTCCTGCTGCGCG 

i I : ^ ■ 

CCCTCTGAGGCCGTCAAGTGCGCCGCCATCCCCTACTGGCTGTTGCTCAC 

! i 
GCCCCAGCACCTCAACGTCATCAAGGCCGACTTCAACCCCATGCCCAACC 

^ J ^ : 

gtggcacccacaactgtcgcjaaccgcaacagcttcaagctcagccgtgtg 
ccgctctccaccgtgctgctggaccccacaLgcagctgtacccagcctcg 

i ; ' 

GGGCGCCTTi'GCTGATGGCOACGTGCTAGAbcTGCTCGTGGGGTACCGCT 

I i ; 

TTGTCACTGCCATCTTCGTGCTGCCCCACGAGAAGTTCCACTTCCTGCGC. 
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GTCTACAACC^GCTGCGGGCCTCGCTGCAGGACCTGAAGAbTGTGGTCAT 
CGCCAAGACCCCCGGGACGGGAGGCAGCCQCCAGGGCTCCTTTGCGGATG 



GCCAGCCTGCCGAGCGCAGGpCCAGGTGAGATCAAGCACAGCTCTCAGGG 

: j I . ■ : 

gccccgggggcacgggtctgIgcatgtgtgtgatctcagcatctgcggcta 



GTGTGGGCTGGGAGTTGCTGCGAGAGCTGGGCCCCCTCCCbcCTGCCCCT 

: " I j : 

CGCCCCCCCCGGGCCTCCCT;CTACATCACqACCCCAGGTTTGGTGCCAGG 
CTGCTCCTTATCTCAGTGCTGTAGAAGAAGCCCAGGAAAGCTGTCCTCTC 
ACAAAATGGGTTGGCCCAGCCTCTTGCCACCCATGAAGGGCAGGCCAAGG 
GGGCTGCCCCACCTTTGCCTGCCCAGTGGGAGAGCAACAGGCTGCAGCAC 
ACCGAGGCCAGGAGAGCTGTCACCCTGGCTGCTGTGCTCCTCTGGGCCCA 
g«i AGCATGGCCTCTGGGCACTACCTCCTCCAGGGTCACAGTCCCACGGATGG 

'S; CTCTGTGGGGCAGGATCTGCCTTAGGCTTCACCCACCTCAACATCTTGCT 

-T' ■ : ; . 

GTGTTGTTCAGGCTGGTCTCAAACTTTGGGCTCAAACAATCCTCCGCCTC 
^ AGCCTCCCAAAGTGCTGGGATTACAGACATGAGCCACCGTGCCCGGCCGT 
^'^^ GCTGTTCTGTTCTCCAATAGAGAAGCTGGTGGAAGTCCCCAGTAACCCAG 
lf= AGGTGATGTGTGATGCACACAGTCTCCTCACTCTGAAGCTGCACATGCGA 
O TGTGAATCTTCATTTGGGGTCCGCTGTTAATATGGTGTTTTTCGGGGGAT 
=fl ACAGCAATGACCAGCGTCCCCAGGAGGTCCCAGCAGAGGCTCTGGCCCCG 
GCCCCAGTGGAAGTCCCAGCTCCAGCCCCTGCAGCAGCCTCAGCCTCAGG 
CCCAGCGAAGACTCCGGCCCCAGCAGAGGCCTCAACTTCAGCTTTGGTCC 
CAGAGGAGACGCCAGTGGAAGCTCCAGCCCCACCCCCAGCCGAGGCCCCT 
GCCCAGTACCCGAGTGAGCACCTCATCCAGGCCACCTCGGAGGAGAATCA 
GATCCCCTCGCACTTGCCTGCCTGCCCGTCGCTCCGGCACGTCGCCAGCC 

i : i 

TGCGGGGCAGCGCCATCATCGAGCTCTTCCACAGCAeCATTGCTGAGGTA 

; , : i 

GCGGCCCGGGTGTGGGTGCCAGCTATGGCACGGCCAGTCCTGAGGGCGAG 
GCCAAGCTTGGCTTCAGGTCAGCCTCAGGTCCCTGGACTTCCCTGATGTC 
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GGAGTCCTCAGCTGAGCTGcjrCACAGCTTTGAGGACCTGGlSCAGTGAGGT 
CCTGAGTTGCCCTCCCTGGCCATTTGTGCTGTGTCACCACCTCCTGTGCC 

ACTTCCAGCCCCAGGTAGACCTCCCACCAACAGCCATCTCCCACCCCTCT 
CTTCCTCTCTGCCTTGAAGCATACGGATTCATTGGTGAGCCAAGAGGGGC 
TTCCCATGTCTCCTTGTGGAAGCTGTGGGCATGTCCCTGGTATGTGCAGG 
TTGCTAGGGTGGTGGAGCTGACAGGAGGCCCCCCGTCTTCAGGTTGAAAA 
CGAGGAGCTGAGGCACCTCATGTGGTCCTCGGTGGTGTTCTACCAGACCC 
CAGGGCTGGAGGTGACTGCCTGCGTGCTGCTCTCCACCAAGGCTGTGTAC 
TTTGTGCTCCACGACGGCCTCCGCCGCTACTTCTCAGAGCCACTGCAGGG 
TAGGCACAGGGCCTGCTGGGGCTCAGGAGCTTGGAGTGTGTGGTTGGGGC 
AGGCCTGGGGGGTCATTCTCTGGAGCCAGCTGTGTGGCTTCAGGCAGCAG 
TCAGCGACTTGGCTGCAGTGGGCTGAGAGTTCCTTGTCTGAGGAAGGGAG 
CTGTCATGAGGGAGGGGTCCATGGCCAGATGTGAACGCAGAATGCACTGA. 
GCCAGGGCCTGGTGACTGCTTGGGAACAGCCTGTGATGAGAAGGGGTTAG 
Ul GCAGCCTTTGCCCCTGGGGCTGCACAGGAAGCCCTAGCCAGCGACCTGGT 
GACTCCCCTGAGCTGGAAGAGGCTCAGACTCCAGAGGGCATTGCCTATGG 
i! GGCTTTGCACGGGTGGAAGCCAGGCCAGCCAAGAGGACCTGTTCCTGCTG 
!p GATGTGCTGCACACCTAGGAACCTTGTGCTTGCCTGCCAGCGCCTCCCTC 
TGTCCCTTTCTCCATCACACAGATTTCTGGCATCAGAAAAACACCGACTA 
CAACAACAGCCCTTTCCACATCTCCCAGTGCTTCGTGCTAAAGCTTAGTG 
ACCTGCAGTCAGTCAATGTGGGGCTTTTCGACCAGCATTT^CCGGCTGACG 
GGTGGGTGACCCTCTGTGCTTTGTCCTATTTCGGGTGAAG^GCCAGCATCA 
CCAGTGGGCTTCCACCTTCCGTACGTGGGTGGGTTATCATAGACAGTTAT 
CTCTGTGCTCAAGAGCCACTTCTTACCCGGGGTGGGAGGAkGCAGCTTCA 
GGAACTGCTGAGAGAGCAGAACTCACGCTCCAGGGCTCAGAGCAGGAGGT 
AGGGTGTGCGGCAAGCGCTGGCCCGGACAGAAGCAGAGTGpGCCCTGGTC 
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TCGGGCAGGATGTTTCTGACTCACATTTCCjrGAGGAGAGAAAGCTAAGCT 
CTTTGCCTAATGTCTCTGTCTCCCCTTCCAGAAAAATGCCTCAGCTCTTC 
CGGCCTGAAGGAATGGCCTCCTCCCGGGCC'qCATGATTCTTTCCTGTGTG 
GGCCCTCCTGGCCCTGGCCTCTGGGCTGAGGCTTGCTAGGGACTCGGGGT 
GGCTCTAAGGGGCAGGGATAGGGCTGGGGAGCGCCGGCCTpTGGCCCTGA 
CCAGCCCCTTCTCGTGCAGGTTCCACCCCGATGCAGGTGGTCACGTGCTT 
GACGCGGGACAGCTACCTGACGCACTGCTTCCTCCAGCACCTCATGGTCG 
TGCTGTCCTCTCTGGAACGCACGCCCTCGCCGGAGCCTGTTGACAAGGAC 
TTCTACTCCGAGTTTGGGAACAAGACCACAGGTACCCCTGTCTAGCTCAG 
GCTGCAGACAGGCTGCCTGGACAGACGTCATGGGCCCCAGGGTGGCTCTC 

ri TGTGCCCCAGAACCCTCTCTGCCTCTATGTCTCTCTTTTCTCACTTAGCT 
GGCCAGGGTTTTATGTGGGGCTTTTCGATGGCAGAGTCTCCACTCCAGCA 

% GTCCCTCAACCATCTGGCAGACACATCTCCAGTGCCTGCTTTGGGCTCCT 
GGCCTGTGGGCCCCACACTTGG AGCATCCTCTCCTGCCTGTCTCATGCCG 
GGGTCTCTCGGTTGGCTTGGGGCCCTTGGTGCTCCCAGCCCCACCAGGGG 

^ CCGGTTCCAGGCTATAGCCCAGGTGGCATCTCTCTGCAGGGAAGATGGAG 

CI : 

□ aactacgagctgatccactctagtcgcgtcaagtttacctaccccagtga 
ggaggagattggggacctgacgttcactgtggcccaaaagatggctgagc 
cagagaaggccccagccctcagcatcctgctgtacgtgcaggccttccag 
gtgggcatgccaccccctgggtgctgcagg'iggccccctgcgccccaagac 

t 

ACTCCTGCTCACCAGCTCCGAGATCTTCCTjCCTGGATGAGGACTGTGTCC 

i : 

ACTACCCACTGCCCGAGTTTGCCAAAGAGCjCGCCGCAGAGAGACAGGTAC 

i : 

CGGCTGGACGATGGCCGCCGCGTCCGGGAcbTGGACCGAGTGCTCATGGG 

i 

CTACCAGACCTACCCGCAGGCCCTCACCCTCGTCTTCGATGACGTGCAAG 

i 

■ ! 

gtcatgacctcatgggcagtgtcaccctggAccactttggggaggtgcca 

i * 

ggtggcccggctagagccagccagggccgtgaagtccagtggcaggtgtt 
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TGTCCCCAGT'GCTGAGAGCAGAGAGAAGCTfcATCTCGCTG(rTGGCTCGCC 

! ! I i ■ 

AGTGGGAGGCCCTGTGTGGC'CGTGAGCTGCCTGTCGAGCTCACCGGCTAG 

■ : : I ■ 

cccaggccacagccagcctgtcgtgtccagcctgacgcctkctggggcag 
ggcagcaggcttttgtgttctctaaaaatgttttatcctccctttggtac 
cttaatttgactgtcctcgcagagaatgtgaacatgtgtg'tgtgttgtgt 
taattctttctcatgttgggagtgagaatgccgggcccctcagggctgtc 
ggtgtgctgtcagcctcccacaggtggtacagccgtgcacaccagtgtcg 
tgtctgctgttgtgggaccgttgtt/ilacaqgtgacactgtgggtctgact 

: j ; 

TTCTCTTCTACACGTCCTTTCCTGAAGTGTCGAGTCCAGTCCTTTGTTGC 

TGTTGCTGTTGCTGTTGCTGTTGCTGTTGGCATCTTGCTGCTAATCCTGA 

GGCTGGTAGCAGAATGCACATTGGAAGCTdcCACCCCATATTGTTCTTCA 

AAGTGGAGGTCTCCCCTGATCCAGACAAGTGGGAGAGCCCGTGGGGGCAG 

GGGACCTGGAGCTGCCAGCACCAAGCGTGATTCCTGCTGCCTGTATTCTC 

TATTCCAATAAAGCAGAGTTTGACACCGTCTGCATCTTCTAAACCAAGGG 

TCACTGGGATCGAGTCGACGGCCCTATAGTGAGTCGTATTAGAGCTCGCG 

i ; ; 

GCCGCGAGCTCTAGATGCATGCTCGAGCGGCCGCCAGTGTGATGGATATC 

TGCAGAATTCCAGCACACTGGCGGCCGTTA-CTAGTGGATCCGAGCTCCAC 

agaggtggtcgatgagggctgccctttccdacatccttagtagggggttc 
aagatgacccagactgtgcCcctggggagcttggagccatgcgggaggat 

i I 

gagccatgtgctggaggagAacagggtaggatggtgtggggcttttgtag 

\ i ■ 

actgtctagaagcaaagaaggtctgcagtgpaggtggtgtctgaggtgaa 

tctcgaaggtgaataggagttgaacgttag:caggcagagggtggattgca 

! 

■: i 

ggagagcagcggcctgggcaggtgcccagcptggcccatcagggtgcttc 
atgcatggctgtgtgcttggcatccttcctJgcctgcctaccccctgctgc 

TTCGCTTCATGGGGGCGTT'JGAGCTTGGGCCCACCTGCCTGCCTCGCTTG 

1 

TGGGCAGAGGACCCAAGCTGTGTGAGTTGTCCTGTCCCGGGGAGCAGCTG 
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AACTGGTCCGkGGTCTCGAAbTGTGGGGCTCAAAAGGACTCCGGGGTCAT 



TTCACTGGGGCTGTGCCGATTCCTGGGGGCTGTTNGGAANGTAAAGGCCT 

1 i ■ 

: I 

AAAGGGGCTCfcCTGGTTANGpCCCTCAANTTTAANAACCTaGGGCCGGGG 

1 ! i 

CCCGGAATTGCCCCCAANTTTGTTTCAACKCCCCTTGGCCfTTNGGCNGGG 

I i i 

GCAAATTTCGANGGGGAACCAATGGNTTTCCCCCAAAAANGGGGCCNTTT 

I 

taacccntttccaaantttgbgncctaaaajaagggtgganp:tcctgaan^ 
gg 



(22) INFORMATION FOR SEQ ID NO: 22 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1070 amino acids 
'If (B) TYPE: amino acid 

^ (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

U (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

1=^^ VCLDDTVTTEKELDTVEVLKAIQKAKEVKSKLSNPEKKGGEDSRLSAAPCIRPSSSPPTVAPASA 
a SLPQPILSNQGIMFybEEALASSLSSTDSLTPEHQPIAQGCSDSLESiPAGQAASDDLRDVPGAV 
.fS GGASPEHAEPEVQVyPGSGQIIFLPFTCIGYTATNQDFIQRLSTLI^RQAIERQLPAWIEAAN^ 

si's I ^ 

egqgeqgeeedeeeeeeedvaenryfemgppdveeeegggqgeee^eeeedeeaeeerlalIewal 

M t ■ j 

gadedfllehir^bllkvlwcflihvqgslfeqfaaclvltdfgiaweiphqesrgssqhil'sslrf 

vfcfphgdltefgflmpelclvlkvrhsentlfiisdaanlhethadlrscfapqhmamlcspil 

ygshtslqef;lrqlltfykvaggcqersqgcfpvylvysdkrkvqtaagdysgniewai5ctlcsa 

vrrsccapseavksaaipywllltpbhlnvikadfnpmpnrgthncrnrnsfklsrvplstvlld 

ptrsctq^rgafadghvlellvg^rfvtaifvlphekfhflrvynqlraslqdlktVviaktpgt 

ggspqgs'fadgqpaerrasndqr'pqevpaealapapve\^apapaaasasgpaktWpaeast 

lvpeetpveapapppaeapaqypsehliqatseenqip.shlpacpslrhvaslrgsaiielfhss 
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IAEVi^EELRHIJ^SSWFY;QTPGLEVTACVLLSTKAVYFVLHDGL^ 
NNSPFHISQCFVLKLSDLQSVNVGLFDQHFRLTGSTPMQWTCLTO 
ERTPSPEPVDKDFySEFGN;kTTGKMENYELIHSSRVKFTYPSEEEIGDLTFTVAQKl^ 
LSILLYVQAFQVGMPPPGCCRGPLRPKTLLLTSSEIFLLDEDCVHYPLPEFAKEPPQRDRYRLDD 
GkRVRDLDRVLMGYQTYPQALTLVFDDVQGHDmGSVTLDHFGEVPGGPARASQG^ 
S AESREKLI SLLARQWEALCGRELP VELTG 
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